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■ T^^e£en£:^ 
rn¥Mlb^1te|^^ 

with) n \-<anyn : R]^S'^ ; ]• > ! : '\: : • v 

In spm^ -hpj^n^Mion^ s^:h »g *P&teft* " re ra ^qi jtrcwi^' ^spfe ^I^b: «e fl fi:^t t6 rf ritta^ J b tosr 
transmission syst&ms^ u^s ^6e.ch,:imp; :a^ 

corresponding elec$oaj $f|^e0*;^^^ 

user is not speaking there .>vj II constantly be- an output signal from' the microphone 
corraspo rrdihg^rto - i :6ll%nce^b'W$a'c^.p)Und : ' noise; iWde* jfebf prayeriWunnecessary. 
. . pr ; bcess1n£pS& 
Iba^ofee^and: (fii|v(o ; increase overa 
■ delecfion clrcuite wKi^ 

wHicn onl^ac^^ IMthe 
Inco'rftlng iigaal^ 

Mb^fgripr ^ end .^M^j^^^Ui^rvi^ 

Surtog- W&fy-W& in* p6H?cVllfe:ici M .^riye^ftpl *y#eira. ^s'bcff ft " 
.detected;. by- ^rri;^Hng%iB6..$vqrttg^ ©riergy s .ijvi1fi ^lHce«Kdl'd'.a^.d. w^fng=;ifpr it <tq tie; 
exceeded jndipat ing -tftat sp.eecH h.as 'the^: started.: I n order for this teo)1nlc|Ue'\iQ% 
Obl^'- ta- .^jtfrtLjiiBiy ,dBtanrtiji?;1he- point* afcwhicfr *8# 
palled end points), the thr^dld has t&ba seUp a yalge near : the nQis^-:f[o:bYvThl'S 
system -^^W@l:^^#|^EUP^^ e;jo;y$^^ 




qwetfe^portfonrote^ fee . nobe: feyel can cause nciWto be 

..^eWcted^a^'speech^^.' ! r - 
:bhe; &i m of; the: pr e.s ertt . indention 1s,:tg .provid e^an ^altera ative -system for dele et% 
speech^ j . j i- ..! l ir !.€- :?f : . ! V. : = : 

Accar^irT§ 1p; b h feasj^eafe ftfe p^s'en^arr^pti b^^Mrie^ a ^eleit- t&dQ$ ntf i bp 
apj^aiu^cor^ 

detsctihgihe pre^ence^rspeectiMn^^rnfi^ serial usrng.^He^ltefed. eTriengy^Ignai: 
Sujc^ ^^ul?: "HeS f ^$e1jB£f the-^ese r#f$ : SR^fii 

rnor& ^ ar$!Cdr^^ ? 

pp^iDtB be:cai« 

;1H£) &it(p^ 

^rcording; t^ ^npth^ a^P^ trgK^e^ irt^#Ip^ IR'PI!^^^^ 
^elermjningJh^lpcation of-a boundary .between a speech: containing. pP,rtj on arid a 

■ '\ r- 1 '^, 'v.!-; k;.-^ if" :: .f • . 
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backg/Dund: 'noise con^ 

co m p ri sin g:~ m e an s >fo f tbcb ivlh.g^th 9 , ] np tilislg n a!> m e an 3 fo p ro ces sih,^ the ; rece iy ed 
clgnal to^Qnerata ah enlBtg^i^nd^rrtB.eri^ftfr ^«.iw^htn^;iW/BkBlihoorf^'hat^He 
■ b ound'ary tfs Idcatsd. at -each a plur^lit^jfcpo^ 
8[gnal;,anWe^ 
ideJermirio^ 

•. ^ •(^^ jSjiaiy empsfliiini^ . 
Ffgute:$-;#^ 

Ft^n3;4ie a # ii$ti$\ 
S[\) into ^serie#*of tiipe-fr^as 

Figure £^ 

.itiWB'vfTOm^v' "v. . \ '■ ': - : ; ■.■■ L ^ : i.; .r'i : ;>i : /// ; " V«.!~ slC-'!; 

Figure '6a. 1 a a^plbt; of tire- ayep^ga frame- .energy: <6f an Input .epeech viignafc 
illustrating 

• :of ipeBcn v«f "iff* in " 

Fia/ina>3b:1s.4^^ 

• Figure J> a\te^^ 

unrt^hoWh. in- :Fi^ure 7^ 
ftg^n^ 

^^y^:lc^k;^.qU©n^: v&^^iohs^an^rthe DC; o/faatj: ; ' 
Figure & fe: ^ 

*he feature extradibr^shown irV^gure^. 

Figure if Is a dlagmmma^cali^epresBnlatifan of.^lhe*.^e9.n1t"ucfer>fes r pon.ser cfethe 
•<«9crBts BouiiWran^fprm df^he» ep8eehiJgoaJ^9how4? ft^0]ajur.e\.%, ; 

/Fi^Ura/}^ 

.Qutpul'.' Df a ^ Q| seale ^ ll ^ r : " : : !;=.;•■■ *' l 1 -, '' I'vV •• . '". 
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' • • _ . ./ tylWI APPLICATION 

/. Figure^ 

output from ;l he met ^ca(^,filj[&r b^nk), . . , \ 

^ifiin: the inp^t fr.e^nae (^sfctea'd os^6r- ^^j^fr^S^^ fefer 

Fiigure45a|s : a plbfcofr'tilB' .lo^majgnitude .«pedtnjm\o^tiii9^j3:a#r6rh Jnavrttel'scaTe 

•filter bank foran/*exam-pW : ^Qrd : vw:h8n'4hBpe v i& ti&^-^^Kl^wd; .:ridise|- . 

Frgurelo'b )s a pbt : 6f^eriog y m^nftuti8 spedrum of Sheioijtputfta^ 
filter bank, for ^he same-word! wrb^M^ 

Figure (5c shows the plot ihowo'Jn Figure i5a; When ^nois.e tp asking, level is 
' •• ' • ;;a#fe"ci4o "tffe ;ji$pi froVlhe^mai scale filter-bank ; ■ 

Fig'Qre & ^. 6i^f00^0 mprN^ntai^ # ths: fgft^^VfrWI logged;' 
niagnitu^^^ 

Figure: 17 js a plof.illustratfhg a: nonlinear tran-sf.ormaj,ipn usecHor scaliha/lhe binary, 
tfalugs r$prB^ p|dej^tp 're;d£r^^ piuj^^rM 

.bjt*u % sed.to represent $|m§ 

utterance" of an axaWpJ.B v Word Jn. ytfiicMhere is little t?aqkg,roond 

figure. Js^eb^matically- shows .the way; in which the energy Jevet varies in the 
utteiwce ofthe sewe^^ 
■ /bati^roand^ialse^- ' ■ 
figure lfe-^Aem^t^l^-.^bjov^- *hjB*janerjy levelsr-sW.fc faiWfam? • ifi> arid f8b 
:after energy rwrmfciMtib^^^ 

Fjpr>tf 9a scftemotibally shoW* Ivyo u(tai*nbe&bY : $ use j to 

■ Figure 19b sch'efratf^ an ^Verafee, of a Irattiing.: e*em)jlf S#t#.ftge 

. qscUi$i^ cau^eTd B3jl^^lSS*^Sl'8j^?S ' 

. .^i^^lSi^' • - ;: : -' .' ■ i : ;:v/- :; :';'^.;' : : : ,!:'■*;. \' ■ J;^;: V ..[J.. ::- . 

■woryv^a^^^tr.i^t-vjpnj §hd 

Figure lSe: achBiri-atJcally shows !W) utterance of a t.ramiag>Word havjng ; ,^, large 
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, - .RAT ^^tl! &T LON 

bat tK&'6mbb$ l^.implemUrite^ in s^tw$r^h!ch i^njip^jn; 
•co nja net (a n ; wt^ -par^jcijn^ • pptrt|iU|ie^^|^^;. ; 

;phpVqcp^^ 




M*en£i.^^ 
sfrr^de^^ 

Th^^pp^iori at'p* 'sp^ji^-X^^opi: $9$$$ Q^bqd\m^;W\ now p;e 

briefly described with, ref ^re.n.ce; ^v : "FtgHre ; . .2, A 

spfc.epr^ ■ 
applicatibn £^ Mterafy ^rpo^-B'ti^^firawft 

Electnce) signals representative -ot Irie Input, speech; fromy : : fbr ^.example^ thev 
.microphone^-? ^re^pfieb^b ^preprocessor-^ 
slgnal-lnto a se^ 
V fraW'P&fis^ 

, via buffer 16/ to a reception 'block 17 v/herec trie, sfteech Is, fecpgrnsecf by> 
cpmjpanrtb, ft&ibpi^ 

models t£. each mociej fcpm p'fiitrfe a sequence of>pe>e^er L fr^e;8 ex^ssed ip 
the same Hind bf .p^^ieirs: &&Mo$* bf:the jniut:*^ - 

. cecp^hltib'ft: -^c^v4? : #- ttf ttft t*m*Wr> Pft^$ W^M^f^' 
. recces ehtatVe'#*i * 
' single .^^etelf ii^JolW* ss^^j^^nhose pfi^lr^i^ 
recognised TheVlengutec/e m;q$lBi:$2Us tq:P°JX^ ret n i^^^?!:#^W c ^ 0 / 
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PATENT ^iflTiq) 

A m pre ds^le^^t^\(D 0 will m$$&#^ ti: 6bm;e*6T .th> .apR^ci^lblq:^ 

f hefunrilbnff ^lfith©> pi^prpcssspfeis^arBtfo exirap&f % 4nlorrntttipn rag A 
•spaec1i^ahd4p- db pWcessed^Ther^.ers ■• 

many -different types of i rifornrmtfon w hlc^^ In 
thiWambodlrnarik the preprocessor ; 1&1s dBs\gn8&~to ex'trad 1form:anfe rstatBS 
ihfpWS^iQhV- ^M : arti)8.vare' defined as bswg Me ra*6r>aM:fra^ 
trad of the :.user # , which, change . as the shape.tof the vo cat tract . ch a'nig as'' 

Figure!; .3 shpw^ k bTbck di agpgrft of ■s oft $ of-jh^^ 

jthelp^ Inpt* #eM> -^f^KS <n^pgp^^ 

IJM*£ : % £u^^ the fnpuj 

for fpftflft ffcqMloti > co-ntirnecr.n the freque^^ 
4K.H2.THtf reform filter. blo'cY 51 ^ 

^ffcj i$ jnJdYtti^ useful fbj Sj^^, Vb;^ fifterei put^.by the 

• filter ;blp.ck % tfte£B># :a&4$**S re^gnitrpn prilprjTian'c^ faH^it .1^ ^s^ : e 

. hpis'e ra'ijp 





sp.er; s8cond'< .In it his embodiment 



utterance is cor^erteimtoAfg;^ samples and stored ih.avb'ufieT.,^ : 
■to lh^*u^ 

^ter tbe;,ihpui spe'ech--has. :beeri s^|p3 ft rs\a$iaW^^ 

If n$# fr^fe;ie-b!p^5, fi^^on *>r m0VW°* bt .the lirtp.ut : ?p»ecg Sfe 
fr^rnes'wiJ'l tfbVbHe?^ 

speech, the fbrminl^^ ctiB/rigeS ^ntindovikiyi the ca'te> :of ch^ge. 

.peirig tf jr%# rate pr : mb\£mig:n$ 

1iriiimd*y : pfi^ 




speech Thete ar> : twp;w^ addr| ; s^.d^hBn p£fpnf>jnq $ 

speech, signal, and (ji) how large a time frarn ; e-sfiP|)ld bp^ .u,s|l= 
jh:e, ^t: ; pb ns^#^ 

Ta the frames should be sp#cientV closed en sur^4lia^ 
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missed and^en8or8'iltfll^Qne-1s reasonabre^on^inul]^- ^^The^econd 'eon^iderdlion . 
is- det erm fhecUty -jftHSQin^omfts^Sj^aem)hs^^.e fram a beinej short enough so ihei 
thevSpeecbuslgna^e pr^ and the-irame^Mpft , 

lono, i enough ->fo ^)ve, sufficient- frequency; detail' so IheJb. (h* ''ibr4iant^v'«a>k be 
. '. '■ idisliri^uispB^-;' \ • \ • "' ; '.' f. V ' "v : !:"." ! . " " : • /■ ' V;'i' : ";! s < : :; : :••:»" v ■:. 

ln^hi%^'teg.^1n:a.% iH;bMBMti^^^ WtfrTri. 
the- front end jirpcesslng. ari;d taiejr lrf;3he;:.ra cp^ijljr^^iialg n d n-oy Happing; frahr bs 

from the: speetft wrtJiqOi ^.j^^^fjtiifia) wJirdq^fog J6b^ipn>f>^i§ ^ i [l.u sjtrlat eiqb; , i ri 
Flg#r^v4.^;^ ln^UV?ignS ^^•^ 5^1?% 

signal in^tfq^-pyefl^ 

xo^eiiitionial ,.;^ieiff> . : ; 
: (uritttbri jF^W^^?!^ P^^.!Ei!6l^. i '^Kwq[ICI^ ^|P^^ct|taCil£^lipri^ caused ^:^5*'^in^--1l)^T^M : ^ from 
the Vpeac^^ 

^ frames; •these c$nvefitian1eLl; fy0jqtyvnQ fupctiqri^^ yws&f tefee* i^St} ^pjtwe' 
>re.pp;gniti.^ 

Ttospeach frames Sl^pu^ tf. circular* 

speed^^e^m^ 
detector 

sj^nal begins; ani afjt.e$W(i^.. fie^MifrJIfe^ s^eecii ft iblatectebl v^xjtHin 

the jr^u0i^^ rh^n&£lr:&^ riat-ife'd fe the compUMioH ally 

ihtensh/e- feature axtrtobr- 7a Hbwawft^wfteh th^^ 

b^binning-'bY^peecfi Wijfiin;%e input: s^ha1; : .Hr*!Sn^^^^ 

p^sit\Q \h&SxtitnQ^ ^^Bj^Sd 'aft ej^th'e 8^tf( : .s;p'&Vch p 

70 Which: then attracts o i s>t iW^^tW frHSacivfra^ 



The^%i£^^ 
#es;crib^ 

J^iri.g^ ^reg%^r^ SjWW $A 

laokr.ng- for modulations w)\hin that-samp.lBd:S)gn^ ^aj$et^ 
ln R^lcular,;the energy di^ajtb^ 

4Hz b with vary little modulation' below IHz^or above iotfz In ciortCia\s1>, ctarj|'e> in 
noise level tend to- occur relatively slowly/iypicalV modulating the signal 8nergyat 
less:than lRzHn;add]ltan, -ra^^ 

from tram e to 'frame and are,spraad ovecihemb dul alio n:-fre que n cy .ran^ a: f rom ^ Ff z 
to half the frame rate Therefore, in this, embodiment; 'the arjdpbM detector/ 68 is 
drcaiig-eoMo^ 
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-energy m :a.irequen(^/band between 2Bz end fctej by calculating the modulation 
power withfim<th^ sap p'^ing-.a: . detection threshold ■vip ^the 

•calculated raodufatfon^ppwar 

Plgu^J^Svis; fc-ploT^ if rams e^erjg v £^^ 

co'rri.spoHteo backgroMni) nSISeV&id^hfcn, bound a speech containing pu^ion 7a 

■M^r'/I ;■ '- ..}-.■- ".. " \/ ' :-\ • , 
V^r'm^^ri^-'ib'pVfV the prior ^en^lnt ^etectbrs stnrpl/ l^sho!^ the mm 

Shown in -figure:. 6fc;^-b$e*t^ ^^e^vppjnt^§^^nt^ 

end of spee^pota^ pi^W^c^irtrfel^ 

tlii :threshojd;^u6;im level. As #:ps ^sKjj I e^l^thg^^ill 

apprecjc^e^n c;pj^|rj^ feyel. 

clianges^ $j^;^jid e.ncj 

. '. pcrtrife-of speech; '•. -4 : ' • •' :■' ■ ■" ' ■ ■ '■■ 



As - mentioned ebova^ln^fe 

bandpasevMere id b^^^^ of?2toz and BHz; 

and haVfn g.vei peak, respo nse at>about ^t ^a-modutatron power of tha 'bandpass. 




. . .pqHjo'n 74:. tftik WjH. be t)te ; -&aro:e' re;W^S^^^W^?^^ °fc£Wf# within jjie* 

^peX^<^?^ J.he • -co^n^ 

The, Wlay jri,.Which ;^j^ f ^ , fetajajl^ ■^d^nrK^d In this |e^pi[ii?^fni Ml t$fc be 
described in mbrS xi&taiJ Figure^'is ^ block diagram .showing, the components- .of 
th8:vendppint.dem^b^ ^..AMhp^ 
erte^y ejajcui^ 

block. 65 and whit* con* in ^ and' |>u;jpu : ts« \$ 

energy. Eft) oNhe^signaJVite-eacto 

value is caibulateersahci inpfcint&;#e^ bulrer^S| a sequence of • energy values 
defined t?y a? sliding Window otteedvilze end .ending at the. eneFCj^ value- If or^.the 
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PATENT *AE*PLI C&T j ON 

. energ^ln the; gliding window. In parilcule^ 
frame* k r Is;. given 

eorr£spgnc^^^ 
: fdreafcb'frq^ 

'>bn^fi#rrii3^ ' 
detects thfcFt^ • 

-and of the end of speech is performed in -order io ensurs Iherff is mi^mum 
processih g of b eckgrmi nd : isignals by 4hetfe atu re exiracior £0^to • reduce recognifton; 
error©, caused .by? the noise and: ;to jtnpcave. ieco.gnftiofv pertj^tatfce--. jn. jhts^ 
.embodfmenMWs- 

baiG£]fcte;.b^^^^ 

fe-^^^B •:^ /: j : >;::!j : ;-: : - j.[ 

. r^ir£^ ;. : .;' ; 

: tb^rfomMhe-W fn {his ^toftunsWWa -txjHtrol^.f 

Figured In s^ 

if it is nqj, then processing .proceeds to £Jep *JS.3V y^H^ rer t ^er^^pt fp I unit 
determines; if the. . bandpass modulation, power w*. f o r p$ &ffifi\3tpm e is greater 
then the detection threshold Th, frb|n;Hhe/ serial reicejved by thd threshold cifcOit 8^ 
if 'it U. not. 1h;en : p^GjB^:^ gj^rojcgo'clf • t;onx^W ' SS^hsrs^to ' 
sam e 'procedure ^ 
eower^% 

from step- ; &3#o ste'plitl^w^^ 
scorn pared With a 

AMMton m% g&ihm m&Mm mmum wwat &^&^Btao m 
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.milliseconds oftfnput** peach 

1f the- numb er.ro&^'esvtibo^B^he-tbras^ol^ is. £NTAEfv\ is -not arealer/than^the 
pra dete rm me cfc number ^|j|r^ th entth e ;?ramein urn. berWs \ft crem.ente dith ;tfta p Salf 
.and flh; '*t^ 

w M for the .netft frame .. is eb:bye Jhe detection: threshold Tr>: if it .\s> -iherx -the 
processing returhs^b sleo. 'S7r t W.he L tte frje count QNTABV of the ji umber of frames 
above the. threshold (s jncVerAQDted If the b/srid pas's m'.tfd utatlbrr pp$er ,w k /i s less 
than taVthre'shol^ikt #U)r*J&gg • piiices^ proceed* % lo" 'step ^r^herevHhe 
count (CNTBLiWI of'^ja -hummer of consecupve Jcwt^*>lg(w .'thtfMhVesHold Ms 
tncra^nipcl ; ^bseqijeWtry iS£ &iftt? $Sflfte$: 8£ fr£ ntfm bje^f 

•NHiiD^indlc^^ Pftnjj^>^ vy^it Jprjjhe 

thtfsno^^ emfcoolm^ 
to S£ rriilllsecondf^qr input sighiil.. 

tne;rogflVl^ 
• ■ t^evqoujrrts 

to step S5-. where the control' unit : 86 wa%4hrough foe:?acllprtt^ 

for the nexWrame whtch 18= tfbcve the<deWdbn^^ S%*he 

numbe^drcohseculiVe^ram 

predetermined num'ber ;NHLE^ :then graces sjhg. proceeds \o step S 23. where- 'the 
frarpe riUm'pS^ in^etf $2?, Ihe c;ph^;ifoil ; 8^ 

•the 'bj^^iBl^- m'k^$r^*fc aboyV-toe detection 

threshold Th: If.lUis Jriot, tfeh the processing rejlurhs. Jo step. S17, : where the;.coun* 

which are^below IheVdejie.aTon threshold 1s $8 at td zefb ^d$ie Rfocfis^ig rfltoma. 

incremented. Dries the count QNTABV is above WOT^J) ^djc>«ng Speech hfe 
storted^hen. *B#£$0^^ 

adV-h ^ initiates' . the.^ calculation oHhe start of epaeeb, pQipV 'using ^a maximum. 
likeiihpod/CeJculatibn bh recem ;fram'e? The state/.?* th*. w:ntr<r .Mn.i^ 86 i> ■ j|en 
changed 1 to ^INSP'^i- ih^ep 

Therefore, fo^ summarise^ when^the cohtrcl Unit ti$1$ -.fn$te slats MSItiffl^ and 
whence bandpaBS.modulafen .power ilrsU.excead8;tne;.detedlon threshptd lli^ 
- control unit 86 start* ^ above the*thrastal&^ 

number^consecQ^ number^ con secure 

frames betotfSbe thresnbi^exceeWW 

for the tKresfioid fcMexeee^ feeicouht 
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CNTABV of the number -of frames/ above the^ threshold exceeds NDT&T> then*the 
state is changed to .IN'SREE G'H ond-thaietar^pofntla calculate dvusfoo^tecenW^ 
Full processing of ethe^fitAiby^hafleature ejctrortbr 70:cw^then beajmeifCerihe start 
of speech has been .-calculated. 

Once.the start of speeph has been determined, the.cDnt'rot unit 89 is prog rammed 

tb,lqbfcMlhe^ 

^fter the", st^ofjs^^gi: ' ; ft^fcWe# cafc'CifiiiWft tnv*j|f(fc • 

■ pbHtis^ the- ? prbces^ 
S31 shown in Figure >8|t ^hfe^lhp; cbftrqt .Unit thews' td<*eeie^if the- bandpass 
m^ia%>^ If 
v^,(sf!ab.0Va.l^ tfeja^ftfr^ ' 

., the fr^tn^;:cp^(t|eV .fc&Wq^ b^dpas^:. 




/eoritrbi. b.ribS&*che cfeilf^he: num-fib^ >o'ftcons>cufyve#fa^ flhe:. ^nr^shplcl ; : . » 

ex ce eds a p red 9t arm In b d n u m b er NENbj which In di cat es that t he speech rtes ended*; 
In this embodiment MEND; corresponding ^^^milliiseeOn^ 

If the; num ber; ofccb nsecuttye frames is-tess than ^E'NGji? tkertAsp pech-has 'rtot ^ded: 
and . the- proc;es.8lr)g x proceircfjs; to. step where ttie< frame .c'Atfh^^^ 

jricr.em'ehted^ Pw:ces^ri5j' tfeh. ^wd^^jBtS^ Mtete ih.e. cWtoi ; unit. i86 : 
deteNiries -jf jhe b'ahdp.as.s ^m:bau'iartbfi p.6^e'r'.fbf ihe : hex'.V^rama) .is : beJgW,;jh;e 
Mectfon^ 'M ttfeft rtrt? corurrig ^ iKt^jO^V^^r^f 

cbhse.eti'fiyi v frM^^lpW: the" ftefettiop . thre'shold-, is. rjb.et In step S # a ncf 



the coii ht th% h um$^^^ JJlJ^h^^iM^Bf^Ml^nW^. : 

qhesithe number B^S^ $$W» 

ytifrlfm^mq 'f/Q^^iy-p'^fi, ;S45- ttjifjjjg* th^ txTnlrbl unit :|^iift1^e|;^b|^ 
.calculation af^thje ^p:bmt of s^ee^h utfng^#^ 

recent frames The stale bHhe control unit 86:1s t'ha.n #ianbj3d .to^§|t^6#,in 
step S-47 ; QLhct the proc^^irig ,ret^rro to et^> 



.Therefore-: fn^summary, after. -the^'be^ determined,-; the< 

L control ;um£0contin^ 
unt&&Wun1ih#^ Selow^er>dliecllbn threshold 

andwhanr-ibis nurtbettexraeds^p^ 

^cnana^si^ en&ot sDeecri is^caTcutdtedr . 
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^e^ne^ 

jorge^ jikefihob^ls c^ert ^^s e/jd point ^ 
framel? u^dib <li#j%[^ 

:assumad:: .- : .. f J - ! ' " - .1 I 

Rafe rr]ngvtb ^fyurevv^ when theveoritrai unit ''S^$i9n^«^^b^Ktt[PP^ %: 9^.i sfade^ 8 
outputs <nu co ntrol' stgnalion line SSSto ;tha*?bu^jfa^ 

tran s energies to be read -out of \the- buffer 78 and input to>.a.h'lgh. pass filter 90 
The flUer'90 removes }he DC offset fan d^aly ^iWly^eoS^g 'noftelcoh'tdbutiop fe Jha 

filler. is ^sa;cbW£#ir hpfapjve flljeiV #|tft;|p^j^^ 
. . shOy^he^ 

.$a/ As s.hoWri i)i # : 'fi |f prXcl if r offt^e l^e'rg^ (luctujat^s "■aBb&t -^^ifte^0.^ri_iy ~~ -j)r>J^I '^Ir^jr^jciei. 
bOrijons ?2t-4 apd ftejj osdll^^S^StlMSN efe4^ft v t>!f ^^:^^Mii if 
is ^ss^ed Ijfiat; ■ '(fflft^ \he< sij^ce ^br|ipn^ Sfre $j«£e$ :tf^#ef^ vbrev , 
un.^orreraCed.frqrn (nam f$®*' 
£$ajr*# cjfreaetff^ ft? 1 noj^P^b^bt. 



je:n^rf^:\6jQW|. (h; tjie; "92 ' t,aj<(H^ js^h^ po;(Ot d^-^j^s-^|^ ^a^f^g ppinj 

<i;e p% b'eljpg ^tlPjM ^^ft?^S^|^r° ^fe^^r^^^- 

•ail %am«s afer. tWs point- as sp-eeeh anij ra^pj^in^each^ctt ;4he -'a'ea^paied>*np1s8 : c- 

: fram as Hnio> a-noise model ;and each oF the 'designated spBech^ramesnrito a speacn 
m odel tbv^jve-ja^ifeplihobci .wDret7pr4hei^potn^b.diag:4he. end : gdi nfrlftfts p rocessls 
performed for ^©ach bt the fsl "frames Tri ;thre- bute$#and theyone^&at glvesythw 

■ best iifeiihood scb.ce: : 1s. biMhe e^.|bi^;V' '^'y'P^^ 

In this, embadim'&fe era ;'used feftsiej { $a hMse: and speacN 

partions ^afW^nl 



jtflyen pffi : ' ; ! 'V f " .•] ! ,- : "/. T . : •' V • '„■.." 

where; yi is the * high^ass filtered energy ^and: <^.fs :tne aaenceivaftance^ 
;the tikeiihood U that frames M + 1: ta N tare-speech ts gfoerr 

where' a first order autb-regr^ssiye process ^t^^kaj,ab|a^ S^S^Ift;^. 
variance c r has been oMhe : 
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eato-aggre«slvsv model endf In IKw . dn^ocftm'anjp^ oy&e^^QB^dfi^.. usedt Tne 
Lapradan^statisucs^e^ 

/usual daussTenYtt^ -and? teat* %v roare [ robust ^eaUniateS: and raqUfpe fess 
. computation; ^ 

. : 'and L| gives the- UkelfHood joe ;a>itosilibn>^ram silence .(6: Ipeech aff rarri© 
tbeyv^TancSs and ^Jwa; «H(c?i8M^ ^iif^^v^ h'PqH ^^x^ Wl ?b is^liie iMlltioStj}; 




Substituting these ssSmnfewtnto^he liketihooc(;-talung lagaj^hm^. and nee 

-catf&anvte^ Idgvlikelih'ood^o.. b%iftaxim1£e% : -7 

This is calcuiated'Xp^ as 
; $e: ^:a;pnt:f - • [ : , v--.-- ■'. :-: : V v '.v' : :^\:'- ' ■ ■ "•' .: «?' ;'':'■!«!• y : » ?:. 
The ''•sa^rte^ ajgotfam-'if %sed^ cajctflate^the eriji pjf^j^Bevpft^i^©^^ j^^f3^^ 
;data;.j3 tim.et:re'v^&^£#^ 
. frames of silence and jehougR frames of speech included in ^he Vindq N f^fne? ; 
to allay* #*e1iable;rgnb^^ ; 
window si2e (N) to include, a suffice nt, number &f silence, and Sjpae'cn fjrarn eSj.This- 
Is- athiev.ed -by: jaktog ; all3[h'B:^ 

exceeded up cjnlil Uie^htrbl^ has;&ta#B.p! 4 ^ji^5jpiarW^J|t"th.fR. 

16 fn>m : e5 :wftich immediately p^ecede^^ 

thre^bjd. 

bncer'the b eqjhn ina, dftspeech has % een -dele ctect fh er*firsL< speech f -a me is ie d fro m 
rthe l clhcular^bi&^# shown^n ; Frgura;^^d Ihe featura^exWctoc £6^ Figure 
.show* fn-vmore dSfail ^e: pompon enis^ of ihe feature extractor 7Q used : m Ihis 
em'bp^rti^ni : Ms.. : ^wtii fee; ffri^8iBp inHltfe/eaiuto tn^caJc.ulaii'bh of 

the magn1(udo:.of th o discrete FcuVie.n transform (DFft bfthe current fram e^b!pe& 
0. ml M^Mf^ the ftscrete Va^lM if^^SWi 

info'rmatfon is fequlrep^^slnce majtf ja^e^ 

fan e. bumari ■:Wdiip A ^-^*ti% ;ttnjcteisi£ela^ 
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<.omplmg frequency. i.e 4K H 2 ,Arier^8rfgr;rnng OFT, (he spedr^tvis passed 
trough 0 f|%*e/ik- which overages 1he samples, wfthinva number of^fraquanoy . 
bands. ^(U^s dfl inhuman aqtiitory system have ahoyrtttfaK^B' tftitfregtienty 
re so Icitio r tie creased wjtrv jncreailh^fequs ni^ 4^ereiPore|^ Idigenlh hfvicail/ ffp aced 
filter banfe tm ona .jncw^V^^hVrviare^iti'dra^frequsncy' bands irv the low frequency 
regjon conrparetftolhVttlg^ 

bank 8ince : ; a,loganthm icaily /space d filter : b an k .retaihs.rffona perceptUally^eanifi.Qfu'l. 
' IhforVfi aiibrij- ■ • ' ... 5 . \ . J ; ■ J •.*;; • . ,j =• « : - : : . ; . \ ' : : : • : " : ; ; yS s \ 

In (he. phesent embodiroe.nC a mel spaceti filter. tank 69 bavin Sixte en bands ,1s 
used The rhel seals ft w$j KhoWin Ihe s# of speech analysis, end ts *V 
toganthrnibiscale that- atfetfpts, I'd ffi^iS)!^ a 
i.meaf scale;- Fjgijre; 12 shoves. trve; p^^if |S K (l^|;jof '$$?rr$i' sniped fifte^fejarflclBSt 
yhen-trt.e> ^^pl^^qW'ft in tHfetf^fl^ The 

• resulting riNft$ttf ^ISiii^t^^ 
the averaging effect:bf :jhe --^^Jfi^^lthbqgh le^oVagt^^ 
'due id the. 'liq^nthfnrc ^$ui^Pf |h^' filte ^.^nTt: 
Tbt;fe:rrft^ 

rt48vpps$i^le 'to rrcode I Ihe eg'eeth signal h§) ^^ (^r:^0m^ ^^ excitation 
jsfgpa! E^t) and a tilted V(t), where- the excitation signal E{\) represents the airflow 
' ente.flng'thg^vocai 4rac\*and : lhe*fi!te rV|t|trepresentsftrie -•filirationvBffect-ontne:-vo.cal- 
lracL/§bnsea. u ar%£ Ifra magp^^ off the speech 

signal? la. given . '^-Stke;rimijiitpf?6^^n:^^fa e miagnftude dfcvthe* frequency spectrum 1 
]E(f)] .tffrt'be- excitation signal wtfh &Kwa^nltadfedl$t1i8 sp&ctrurr* tylty] [;#iheiVOco1 



^'rect- filter,, i'e;.; 1 ; 
QrVe'methibd; fcftowri |&Mi#^^ 

from tKejtfh'p.^ "This; rfteJNo.ci 'Ir^^e^^pa^in^he . 

• ttdfrtfact fifter/m^ni^ 

. [E(f)[ by taking the logarithm^ the ^906(1 rffagdIJaS^^^«^p1^srfJ&itel/M^6JtP tiftlt^ 
in -.the" exdtalldrf : $nd' Vocal tract; ^filter pharrihanstjes becontfrig, ado'ibve; 



Figure; 13 spoytg. JhS $0VS Iq^^^ijh^lpdgea ,pulpU[ . front: the iftp|^J^^oji N ^|' Hai 
log \S\()l which shows gW.:hlc^ ti^Tjadwo ^plrl«p<) ^^pr^^l P^- : .$i?.ci- ■ 

103. Component^qj (i--r£Rfc^ lie iq^i^iOt 
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• • ; PATEHT^PFlt'cATlbH 

and component® Js;:^pr^^ fog ; )!^^ 

"Phe-paaks in.component^oimccu^ 

the, equally. spaced peaks in component lW .occur afcthe harm o nTc frequ enctes.- ;0Y * 
• thft jpllch-otetlie spaaioar*- "' 
• the^oWlr^^ i 01vCan.be, e^trQct;ed fro.m th e exci^tlorr characteristics 

:dynomic noise, ma$J^ 



Tha^se w Befell ifffi£$jfif\X$$ 

calculating the ; maximurri-iog filter^n#ien^^ 

Bgufe;4f^ 

Jlhis e)(amp|e/thi5 18 ^second filtfef tjarik.,^ its value, is stored As m&mt* Th;$ 
noise masking block 73 nh en'- determines a minim-urn log filter bank energy, rrmr^ 
by subtracting, a predeffned rah g eV (cri q ^«ryd< ©jrnpiKQ^ly fp^ndr fpqm tr^pfrTg^pe bpH; . . 
from the maximum 1 log filer ban%enef?o^. determined $or%the .current fVame>,;i e Ihe 



tiblse masl<ing. bib^-Av.^determThes^: 

Finally. thewolse> m^s>iho^ 
: ■ ^6w=#:eJ^^iial : to^mej^ Ilrie : r#as^W^ Spamfe;. 
hbjse ^ . 

Figure l$e^h6>*s^ 

fsv little;' tibi8e^A3f#p*ja::: iHe iei^^l:;#riergy Was. tfiipee^ . 
spai?e^ ^t ^TOhe fr^u^rrcy a^p^ Rpra ^$srio#$ th^Mtf m^J ^^^^iHe 

100a ai^ 1<^#^^ 

signal because of Ihe -additional noise in the pigqr&r^b signal *a : ' mlsre:c0r1'iiiOn . 
;could be ma'de However; by, defining a noise-floor whn-Tafe/enqe^ to t^gfeak/lpg 

filter -bank- energy of" 'the; respective, irame,, it; is possible -to reduoe sucrr 
m precognition errors since peaks 1n the- log (liter bank energy w.hb>T may; be close 
tc4he noise floor ^|and hen rerarry pied by ij) are.^BufomatTcally m^ 
taken into consideration dunhc^ the mot cfilhg. process. THis is iljustrated::in' Figures 
15c and 15d, whteh show, the looker bank en 

respecliveiy when iihe^ :dynami^ 
peri^med*^^ 

" . '•' ■ •"' ^1*3 • . : -.v.= ':X : .'"'-V ^•v/'" ' ^ 
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signals outptrt correspond:.mofe<cloBB(y.ev^ rib'fefe. . 

the concepMDf no^ss masking Is nut new, (However, ih, the ^temis^proposed ^to - 
, daJi^ a^onstehtm^ 
^a^iS^^o^^is Sail. W &B&e t fie 'amplificaVibo jaftci l^^a|||fe^|5t aacfr 
tfarrta isi^e ial&W 

Howe^%is^ .= 
(AGC). at lh& ;}ji^ & aiffSftftt ;gojrf to each .fafteck the fnput 

speech; sfhce WP^WWS^® ndt;:Rn;p;Wii, V/jth theid^^.id^adise 

maskin^^.^©:^ a j^i^ jfdMM^to't figph; 

• ■ -fraiftfc ;m:nt^ it- does not rh aft ef - wh" at gainVhevc been 



Returning ,lo^ Figure W; afterthe log iil^rba^K .ener^es have Ijeeri maSj^d' bf ffifr 




sra;^al^ie^n^« 
provide s-sbme^^^ atandarrf S§7V • . 

Figure jSshow.s. the outpuV-of/tfce DOT block 7$, .which fs.;knawn as thexCBpstRim 
Cfe"(m);. The Independent variable fc-r«xis .. . of Figure ify op the- cepstoim <has,, 
^imen^bns"-^ :: 1IlTie ^nfcis. given the: name "quefrahoy^ . 
component ioi snbSirln Ffoufe t& become* /peak LOS. in'.. I he . centrum at a 
lo'ccitioh'- ^jqUiyaf.e;nt' 'to 'the pitch ^e'no.d' T %i, -the' '"£#e|#BVM3 ' ve£#nC3 ... 
convpApeht^ ontb^&^uWSe? bfi^rfail p££&j • 

lO^ee/Mhe gHgm of the cepstr^rn, 'the tuitions and amp(ittJd^\oh l wh t th,:ard 
! jdepe'ride^^ * : ' . ;v ■;' : . 



m 

%etren cy, scale, >th ey/cart bef 8 e^ra^f fro;rn ^.:^her% / 

frfilterth^ 

cepstrCim shown- in ;Figyj& 16 is^mfldeH^p;;: £f;# • $^0-$0$L$ 

cp.effld8nts' (§<v Cji, ... ?4^i^3>||,r6f^ Mftfi^ng^^JQH^ 
of a simple .rectangHiar^ndoW:, tfqweyer; irVoj^^ 
speCtnjm^a^jarB o$^s|pii^ 

is preferred !n:;the preiserft - ^ the^followln^;^ iisedln 

Wjdm 

lifte'ri%tocfe . _ \- :"' v 

In- tnls eml3bdim;Bn^ ; the^ coeffTaerits .are:.^ the 

rlfeMir^ 
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speakBDvsrificatbn systsm^ however^thaxcefflri^^ 
. :use$ sfnce^the pitch; o^^ 

Tfte> co eSclerit'srouf put (torn 4he Mehriyjb\o&>3$- aire. each/r^pm sen^etJ iaL® b;| 
binary n^mSS^ 

^ere^cev^oifelsi arid to: store Irje c&ejifl£sft^ 

ftuVri&Br of; tlifs" fbr^eaph ;te^!0fa( teHu'cM Jb^eigrit ^e: ^PU)^ ; fe 

i&fftttftid ltia£tne xflpi^ii' -^64^34^ ^S^uWct#o b e :6i;uVt6.'f$'d .aifrun ci :a. : j^earV 
■. Vsl'ge, With- pccaslcihet oUUler^'And. s^fi res c.dj nglw <?ul'd.'t rCe/eTote>m6^it>in rWp?t ' of 
JtBe ^psl^^a;^ $#^6$ $p*$^tb^^ 

ffti^tty ^^s/ot^^wr^ a nprj^n^^ , 

which- 1^' applied in\ : .thi^^mljd#m > ept ? -jjrt pgrt^euj^ 
• sMeeo^bJt bjritty ypk& end. tr^ V^^^^y^Tt^ : 
; o^t^n^^froro |ij^^jil^^ fte; pan 

be seen from Figure 12, the sigmoid function 111 lies a pbrtT^n li'3 erpund zerd 
which: W substantially linear triis • go nresponds**to the^arearin whfat most oWhe 
<cep3tral coefficients: are to be found, Therefore; the nonriihaar^ 
shown 'in Figure 17 etferilvely.in^ 

•the cepstrol coe'fficTents' which Je away from the ettfeij™*^ 

:■' ^rpcetenlir^-j^ ' .• 

•Ih : atf3r(^^ 

Jjie^eeoh^^ e r^ogpllio fie^ i^tik ,e 

inpuVfranie' £ne^ "ft^n- .fm^prtanj feature., *!«ce£ : $$mWt U&pW ^ 
things; lb; indicaie .Wheth-er of not; the Inp^ tUs'p eeph signal 'd.unng' the ffom'e: 
. . corresponds to a voiced, speech signal, ^described above,-. 4 he- frame energy □/ 
each Input-f fathers c^lculftte.idjh^e^energ/ csSculaiipp/uTqi^g^ in buff P r 

78 shown in' Figure 7, The 'energy for^he cLQ-i^rij'fr^m'ei^pUt by the-buffer 78 is 
tften norm aliVed by me hotm.e1islng^b!o;dC^in ordff toj^ 
by yanablB Pergrding conditions . = " • ";. ; '"' •' • 

;i|lasj|Rate.^he ^p.e:^of.e;jr|ergy ye^.^tio,r^s Which can: &ffec|$b§ 
^^roition; ac^racy^ i^p^icOrd^ Ri^ufes:,. |)3«v :a|^||i:5^«^ 
energy iayelB/ln-yiw.c- iierancesxpMhG sarrte'Wordi f Hejf^^ 
>F%ure ^ ii^:tQOrf.:u%ew^Ber5:w-fth- low bacAgmurid; holle^oi^he^econd^^ 
shown in- Figure :^8b f Ts; a quieter utterance wjfompre baefegrojund noise; dimply; 
usfngVthe energy-calculated : for each utterance by /the ene rgy - calcu fatlon vun ii :: 76 -as j ! 
a re.cc.gnrt)ort : ^ e:sigh1f idarMWfta^^ 

Sdrm ; al|klh^o Ch.et me^eafe^e^0n^oth. ulteteo^^^^ 
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the.m Fsmatch^ 

quieten portions ^e; utterance^ Im orde 

embodiment,: an ^nera^ waskjh^-^ ;: 
described obov.e)^^ 

teed- arrant betoW fee maximum. WM below ir?a maxWnV Tfeie 

and l'*3Tshbwn ; iri FiguVas 18a/ahd;l86/aW^ 

f^festiWir^^ 

12Z whicji Is set In .ab 

npl^qwjti^ 
^he rnpuM%&e;^ 

recejvgd v^ithdftl Wariln^unUllhe-.f nd oHbe; uUerance. Kpw.eVe^ this problem can -be- 
. a;li^n&m;:4^b^ 

has adualjy . stertedj fef* teerdforei likej^^naV t'hff- maximum energy ? hae. keen 
• encountered-^ .tsr.vfirsi regui1redVi"ne 

fbHWing^apptoach to est^ 

: delay energy Jiorm alls ation until -the. start of: speech: has been confirmed and the 
' . ' . srecp^hftfen^e-a/phfls |'ftbo:ui|lo\ be.p|h'j.? - 

. r$e£$i#ev!fc^^ Mfeas^ j[2^';gra^ter'](fjiri;;' 

■ . : $ne sfier) ce efteto$;r " \ ' 'y i' _-'\ [.■''■■' .y' .. ■■ -..;•'•"•'>. 

#0v&3oul^ 

)y) perform rpexlmt^rn WrMsatidn usfng; tb^gfceier bT ifo mi^M .energy 
processing,- rie|ay the .aboVerp'rp.ce^ii'ng. for t Knee, frames .. . . 

;enfer^-Qq^ 

I n.:summaiy, t the preprocss'spr 15'conUpuoUsJy m onitdrs' the input-signaf an d ..when i( 
iienftfieS^heb^ . 
n in e^cspstraJ' coefficient 

%ey-caef^dTe'hUvecib.«^rA:ip8jd^ .V8dbrs^ou4^ut^yJ'h'e' preprocessor Are 'men. ' 

com pare ivfth stored *ete^ 

sysie)n : end:the.ttcpu 
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-ortpufrb^ti •;' V 



A brief das crijatiW b? tha way rh= wHiifch M he worf^m odefecfBscribecl above ere 
,ge nemtB&'Wll I hbw^ 

■ be- used hy nhe ;sy^e4 TheVfpput fd foe' tfanlKg peb'cess ft' r^mfipfe trWng; 
^^pl^ter^d^i^ example- ^ a se'.nes pj feature vectors 

$#d»^ 

slilghjlly Mbre;^ccUi;eie w v 6rd ^o?deli£ 'Tfterik is vetjf. &$&:?J^ 4i|jhg: 

"Th B: training 0lQp1ithm .Tfr9.Uy takes -two ex'^ple.s,-^ iqplits .;,p v ^.8pBjate a first 
wo rd model If more t,han Wo; s^amples. *ra£tq; ^ ; ltsjs tt^^|ri ; ;t 1^0" ^ t^^^t rt^jrt . 
generates a second word model from 4h # t^V*t ; iri^^ ' 
The- lleration conilnUBS until B,:r 
■ •w.ortf , mo tfe^ 

tfther,case.Vhe^ 
.^mpdeorbni ^sfetw^o^xamplesj ' 

Th^finel 8lep3ftv^ 

examples.. This, allghm:8nt pro:ce.s.s Is: pejfdrfted u$ng i4Bxlb^ 
•afenm'eht; pf odess .vhl^r tfbes. n6^:coD?traih ^Bfe^W*i:QpiSft : u■|rt■ aUcjnjp eht jiatri 
jDsitWee^ 

:i^iwiJI- ripfcrbfe 'd^l^^^fi fiwi- • :'V : /. "\' t "£* '"■ 

Figure. ISa illustrates tK'a MHtf jif« a/Tlex&f^ 

process be|Vwed¥h tw^raTnmg^amples;^ and ^3 rf A<|hown in" Figure t§fc 
j^alrun^a^ 

background noise end ^sp'^ 

traming-eXam P Jevl53 Ufrtih'tt pJpj1ionSVj53Ha and ; ;153-lb^t 5 }he ,begjnnmg and vend 
1n ereof corresponding 

153-2: The alignment process ^causes the' praise fra^Bs^auW^be^nhjn^ and'end ^ 
Bach training example i5l and 153 to be matchsd wA -a silence- or noise model 
155 and -the: speech po#tons<i5ir& W'Mfllfr-ke »l|g[rred\wiifi each pthen The 
word model for ^he^spaebr^srtnerv generated by averaajng . 
portion and Which are aligned with each ;o.tW However^he^nboves 

procBssfag ;ceJi cause eixoi^^ 



8/re,hbU consistent In*®* errflDbofa^ 
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(hat -only cd'niista^Wnlritgt examples aroused io generate a- wortf-model 

Trie porisistency c^ 
Inconsistencies beM^^ 

For ekoinple,. WKBh fhB&ser^i^^^^ e^blde'ot'li.i^. 
.bre.ath l tiea^ly [int'pj ttij§ In^c^^t^ beglnn.^ 

possibility : is shown: to yitiih Vhwsi large bsciilaViQhs '4515 at -the 

Thl*,jsMlli£^^ 
the^aecl^pb^ 

: fbpu^.Jbut;^pt tbe^r secoftd^ act' Fiftf^ durirtg ;|tYe Input P.fl^f J^|nJ(i^^|.fTi'Rj(8i |. 

lha tuning «xampla This is (llustjated jn[ Figure I9e^fofch >h'QW"s*ih8^alnl^ Word 
^ith;fpo^ 

.. ftieipr^ 

consistent, ^and -lH hoy are, then- they.are used t'o-generate a raodel ioHhe word 

' "being- faflie : dv i&tefy ■ ire facofletsiEeri$ lb en \it'h& -JoKow^ . .rotas. : ^p L pV^:;Jl|: one : 
^exam^pte is aJraa4yna;. waril nftOo'eJ (farmed; .by iiwo Qtymota, yrti$iii&!$Sfii$ 
. examples) then the other:^ and an" e&&exi^^ • 

ii) If ; bbtb;.->jhe -ekSntgle'*- ate' diredtl^ ftofc Srje feature ii^d^^Sia^fe^rS • 
exarrfpie^ 

aribtfteff^^ ■ 
•lHis' ; •cpnsisVBnt pair W exanffelfeei v^lll be Uge^b ^S|>TrefafW^ wtfrxJ ■r^^|l.; .^d-4hf • , 

hi). If. jhG- $16$^^ ^^^^SeA 
exampj&fe diebrirde^ . ... . 

as the first anfJ second .^examples. Theism ,then Waiter .an qt her example 

A count Is rnade of the number of incon'tejepcjes foundMor each WorpJ Ihfctjs ... 
. . . trained, ^f the nurrtberof inconsistencies exceeds afeB.d\^imU^; Itien ajl further 
inconsistency checkirtg-is turned- off ™ 

:-ge^(inig. siuckl'rr rnf^itej;: Jp'op'^ " ■■' ■ .. 

thai con elstenc^^^^ 
. thevsyslem^^ 

. s»renc» portions,^ 
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thasB>tw.o^values;^n.^^ 

for the. bu'r>eni trarriing e*ar^les- aigrae with, the rtibdefb theh tfie,1wo trairilrig . 

Tfife'-tfr^^ o^crftr^^^ statistics d( : 4h:e.se; r ,tWp 

values ( and wj; for :a;^i§^.^^>f ^ftiffg/^^i^^ wftteti 'tfre kjioyvi} jo -$9 
tonsis^nCThe^^^^ 

- 1^ ^ 

: ^slffcs^ for^their t^nmg\alj 9 nm ? n} fie 'arrf w) li^ wiltyi }h* 9§# ^ability • 
contour pf;^ 
are bejta.^ 

After a |)^r!b>trai &4P/j^ti^t$ -itfo;^fw!lf ^^tkffi'^#]^c^v| a'pcj 

wM6p : 4twjse $Sn1l^ .fix-m^s^fBra^ 'be turad iti* up date % 4he stored modefc "wr-. • 
*eo nslste nt utte ranee s ; Th is *ean*b e> tfon e u sin waxlmu m 1 ikeli ho ocf tesWm ailo n 
.techni'^uB. ' ' ; ;■• '": ... ■:; /y- 

Afiar4he.$y$te 

the input Utterance; . . 

,. . . ^ : tfto>e ^ 
moi^ter^ 

Ai Sjie^rr recognition a£^u£^f^ ®M 
irie^for^ 

With ;Jocal ^ene^'.wlth in th s i o'ceiyea ;S igna< r 
meanTfarvf^ 

ir^qgiBijj^^bel^.a:. predetermined ,fraque;nc^; 
me^ns^Yor mmpaH^ 
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. . rreco^gnltion^resuU; : . ^1 J.* I - ; •' 

An:, apparatus ;accprc(ing^ filtering^ means ;1e pperabliB; v fo 

An tfp^rttus^acfortf ^V#fre%s^ ; 

.<eher^va>'^^^^^ . 

An.'-a^pa^us acWnding ttf claim im^;^ p|ffe|b11|jtp 
pass en erg/ vari;at 10 he w Htctt have; ^a 'Tr.9;q 0| ncy, .of ap p rpc$ I m>te7y- z r 

. . .■ An repp^ra(us ai^of^il^qr S^^r^dTrigrgfatfe wherein, saici' Meting mearif^ ■ 



mm 

^'deteifrth^^^ irif de^|idanc^iip,qri thtj.ra&uljfof said cpm'^arirc^ 

^.appaj^^ r^eans^l 

aad : to determine the, energy of jhe$|lpjut ?i:gfife(i 'tip: 
' :d^^rat&;!tfe'iy^ ■ ■ - ■ 

An apparatus according; fo clatm comprising) <modutation power ^determination 
means for tfetermintng the woo'oiation power erf; the. Tillered Signal within a 
. predetermffted-^ 

power :detS^ 

In dfiscrete portions .of said .'.BnergV„ variation sir--' 



■•j^y^ia^k^^^§M : 9^- fymMM^M- tiMm&M&w a^a.^atd power 

mpd^lattg'ni^^ 

trarisfqnp: 0/ e&ch thscrftV* pprffoh W$P$ eni^y Vernation signal,;, ■ 




means ferAerJn^seid sequence:^* energy value's to remove 'ener.^ variations 
tyftich :have<^ 

meene,:tar 'iieteeiing^h'e presencev-of Kjxeeefi,ffc«^ 

■•en orgy- .^values, .and : «. - 

means.forcorn parlng'trie d fete bled speech: with stored, refer e.nc'ernbd els.Jb prpUde a 

;rec6gnjtuari^.feu](v . J • •• 
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• •• cqnt^ 

. An *ap r p^ro(us ffwr«Sng:io^lm%'-^^erem^soid boandaty defermimng. meana Is 
;pperape4bF^^ 

plurality 6f J possftal q Ibcations- wft Kin- sWfd ^nBr^ s^rrol -^ncl maMst for cfelBrmrfhlng 
'the locdjomwhfc^^ 

M appli^uB; ftft'- : ^^^|fe| flii -tfiSatlBfi Sf &<baufaa$ leiili i|« • 
•1Kb local ene.r^cjy -withirv^he re:c^\ted^ignaii 

means fqr-datermlnin^ the likelihood that satd boundary is (o.cated^at eaph-pf. a 
plurality of ^ps>i£^ v ' 

. ;dete^ 

An.appa/atijs eCc^$.h^i^:9'"\tip>^^iFi> 1^ -^-^^S-ife^-S '"S^^SS^^ 5 i_ 

Operate, to determine: tK'0 ijKeli.h^pd th.at 8a'|d ; -fco^d^ is located ;af-ieacko^aiji 
, possibte'lo cations- b^^^ aiciM? 
tWcurrsnt' location bnctfgroundW% 
(il) 1 c»mpori]ng : .\the< : p.qrtlon .ol the <Gnergy^s'lgnaI k Oft •> the *othar fritfe«^?4h.fe' current 
rocatfbn^wjih AWtfcfel >*p.i?.s£nta# §»! contfbfnfngt , 

An app^attis^ 6^:^ : ' : 

is^perabJe'^Q pfocefc ^al^GelvQd ^ to Heniify-WneU^pCacnM^ pre&nt : 

in the, receded ^gn^l^W^Breln' sajd liKeJih'oodi tWi^M^^^^^^^P^M 
def^in^ 

• ;AK s appar^us-^ccb^ih^ tq\ $m 
•energy ajgaal ;tq tem^e ^g«tj[Q^ whfdjr fi$tfgfc ^S8(tS£ Mp* %i 

.predetermine"^ 

. An,appar ; atu :? a^rdln^^o^:^^ *| ;^|rtf n,:s^^ 
out energy varjatipns bejpw JHz, 

. An -appara^^ ; ^ 

' '• 
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. - ■ .■ k\ ■■'\ ^M^^li-p 

operebMo dk/itiM^ 
and 4o determine #he> an 

An op^mue.Qi6cQndjng to~ t cl*frn iz Vhen .dep.eadent.i/ponxiorm 16; vv herein, *^ 
' : ftlte/e:8 :er3"&r|y stpjdi;-/' ~:':\\". ; *;. ' : - '■ . •.»;•..•.'; 

^p$&^ \.'z' ; : -'"V -:: :,f -';;f j 

: An ^'p anatq s: ^ccc?rtl( hg t&>d$ft-.-2§ w^e^^f) models are. h^e.d on %pl^ia:n 

i/iregrff&sivevmo'deir.. i ■ ' ■ i ■ . ■ -'V : 

A^spBech;recqgn^^ 
processing!^ 

.erief^ -within the tecelved signal; 

^ifib.QfsSS'd Srte'rtjy &i'gpal jfeb: VteViriibve »tte:egy -ySniMQn^^hiGb have e< frequency 
• . -below* r#B'ej(e1riii fiiecl^rerqii^'nx^ • •- /. ■ 

jtie^^ 

, . : $snal; and • : . : . - t-ys '*"•'>' ' " : .; ■:. " '. . = : \ ; ; 
cbrrii*^ ' 

A m ethod ac^r^ 

,A method -accord^ 
variations wJiT^ 

A method according \to any ofxlatme.^^^ 
compotes ^ald^ 

presence of. speech • In dtfpSbtertee uponjhe resylVof^ 
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25] 



.;s^ft4 "•■.vij-v :: • " ;". ' ; . : -'V '': ' ' /y \ 



.-Av'irtilhod 



according to claim >3 1 ^ w tfersiir said fiRetlnjg step -and sale" modulation 
pwaftdetfcrm mfng.iitep aFa^op erefeteato - flitfe rcantf dateim inev*He*m adulation pow.a r jh 
'dfecrete portions ofuald energy 

A '^'-eV&otf bcgordmjg/b/datrn £.2 w.HeftJn laltf .tl(te^g^fiW&^d $ajft ; pd^eft 
modulation- determrniog 8te'|j,;determ:m8 the first hon-£>C; co'efflCs eri^of- fe..tiis)2rel8 
Fptfnar transform? orbacfr discrete* portion of' said enef^y.yanafi 

A^jwttl j^co^ritftoo^ 
frames e"ach representative of a pdrtigh of -art input signal, 

pr^cejssl^ i^^jyen^e;' 
of energy values' ind^cajl^.df Die Joe^ energy Within the represrentafWe signal- : 

fijterjn#:0$^ hjap' oj 

d^e;!ctlwp:.'jE^: ••q| ;5 i^jtf.^|ri|- s^i^lnput signal gaing .eeijcl filtered, ehergy: 
^Ues^i^and; . : : : : . ' ■ ■ " • \ . m . • ' ' ■ ; 

compaiingiWerfetecte^^ 



A- method; according to any o^ .elafms '25M0 3% fu rther vcom pjisip g tjhe .siep^: pf. 
determining the boundary b.ejween a speech containing portion arid a background 

vh6js)e bpnl^ifig^ 



the likelihood nh& m country Js located at each S8 MWr&.ty ttt powWa 
portion 6Ht< Jfttfacfcg^ stf£s<*r) • ^fie 



rnethppy : ;cxmi:pjis1 



determining- ^he likelihood^Katvsald' bounda^yi* located . a* e a ch o^^piurajigt-dif; 
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. ; :pOB.«ibrev"locatfons j'wltfftrv; ^a^enei^^sl^Ba^ and (:•' ■ . 

de t e nm mi no^th e • I ocatlo n>ofr said boundary using said likelihoods- determlne'dlforjeoch 

&k*Ml:a^^ 

cdrrvpflMog rf'poSo^^ $ 
rrioijel repriB;s0;nV^t^^ th^ 

the energy signal- oW^he^B^^e ^3&-1&q4P .wljfl % Motipi/ . 

cpmpan^pfy* 

A/methq.d^c^cJjng ^claim 3 ?'o? '3 8 -ram posing; (l.e|:M.^-|lep 
s^gnaUand^w^ 

the received signalmen ' s ^d speech detecting step' ctetecte speech within ( the 
.receded: elgneli :-' : -\ ^\?'-^[^'.'. - \ • ;.; : . ; " ' ■•' - >: 

A mBtlrwd!-tfccorbih.'s^t : d ciaim ^ f art her^com p rising the; sfcpr*ofrfiiterfn]£ said energy . 
signal tof-remove energy venations Which have, a^requency below, .a pre.delermtned 
.. frequency : . : : : "V" ! . : V 

A rnl'tfta3,;Itc$WI'riS to: claim* ^C^netSfh Witi W$i&^ 'ehergj/ 

• '-yari^io.^ , \ : ' 

ilfe^pfa^ 

energybf thetnpuT signal in each" <^l£jg I tim e 'fames fo^^rate'a^jsc^etd energy, 

A* rr^tbp^fmo^o; Jj^r^ ■p$Wi%?M*$$ $W 

fifteiriiVg-s^ 

■ ■ , '. '"":,! ■: /'.." ;. [. : ' ." . . : : ;■ .' : .. ':, .. \ I .■ : 

A m«thodiacco^ing\tb)plajm 4^ v/hejTein sa|d lil^ihoti 
said like! |h0od;fbr^e 

A method acpofdirfg *io any Qf.^airns 37 1ov*H wherein a aid boundary la at" the 

• begSnriingfonatn^ 

A meihod according-to^ / 
."• :mbdeisk.are^siafistjc^^ 

•A : : jn^hoa:.-.occorc^h5^JO'ACla1 m: 4$ ow fie rein;* alt! models :are based; pr^ Ua^iaclan 
; isVaiis'trcs; ' .'.•,'.«. . : ; ! _ '•■} • ' . " : • .:■ 
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v*C^oi^ge :.|rf¥gjg^ J£$$|#^#^^ controlling © 

ProcesBOP^pten^ ©^processor io^inijidem:eh^e 

;methrod'-d^ony-.one.'df cli'irrt s.*"-25 : -^o 
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